We review the widespread notion that the inclusion of taxa scored for relatively few characters is problematic in phylogenetic analyses. Taxa scored for few characters may lead to lack of resolution, but need not. Lack of resolution may be unrelated to missing data when characters conflict. Missing data cannot produce groupings for which there is no evidence. A common approach to avoid the “missing data problem” is to exclude incomplete taxa, but excluding such taxa is inadvisable because the information content of taxa is not necessarily correlated with degree of completeness. Another prevalent strategy—excluding characters with a high proportion of missing data—may actually contribute to the low resolution problem rather than ameliorate it because removing any character data removes potentially informative synapomorphies. Other approaches, including the use of less-than-strict consensus techniques, have the potential to obscure evidence for alternative relationships or, at best, provide incomplete summaries of the primary trees. Missing data simply represent the unknown and should not be viewed as an impediment to considering all available evidence in phylogenetic analyses, nor used as justification for excluding specific taxa or characters.
How to translate text using browser tools
1 June 2003
PROBLEMS DUE TO MISSING DATA IN PHYLOGENETIC ANALYSES INCLUDING FOSSILS: A CRITICAL REVIEW
MAUREEN KEARNEY,
JAMES M. CLARK
ACCESS THE FULL ARTICLE
Journal of Vertebrate Paleontology
Vol. 23 • No. 2
June 2003
Vol. 23 • No. 2
June 2003